Overview
Brought to you by YData
Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 9986 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.6 MiB |
| Average record size in memory | 168.0 B |
Variable types
| Text | 6 |
|---|---|
| DateTime | 2 |
| Categorical | 7 |
| Numeric | 6 |
Country has constant value "united states" | Constant |
Category is highly overall correlated with Sub-Category | High correlation |
Discount is highly overall correlated with Profit and 1 other fields | High correlation |
Postal Code is highly overall correlated with Region and 1 other fields | High correlation |
Profit is highly overall correlated with Discount and 2 other fields | High correlation |
Profit Margin is highly overall correlated with Discount and 1 other fields | High correlation |
Region is highly overall correlated with Postal Code and 1 other fields | High correlation |
Sales is highly overall correlated with Profit | High correlation |
State is highly overall correlated with Postal Code and 1 other fields | High correlation |
Sub-Category is highly overall correlated with Category | High correlation |
Discount has 4793 (48.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-07-17 08:50:47.692656 |
|---|---|
| Analysis finished | 2025-07-17 08:50:53.343545 |
| Duration | 5.65 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Order ID
Text
| Distinct | 5009 |
|---|---|
| Distinct (%) | 50.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Unique
| Unique | 2540 ? |
|---|---|
| Unique (%) | 25.4% |
Sample
| 1st row | CA-2014-100006 |
|---|---|
| 2nd row | CA-2014-100090 |
| 3rd row | CA-2014-100090 |
| 4th row | CA-2014-100293 |
| 5th row | CA-2014-100328 |
| Value | Count | Frequency (%) |
| ca-2017-100111 | 14 | 0.1% |
| ca-2017-157987 | 12 | 0.1% |
| us-2016-108504 | 11 | 0.1% |
| ca-2016-165330 | 11 | 0.1% |
| us-2015-126977 | 10 | 0.1% |
| ca-2016-105732 | 10 | 0.1% |
| ca-2015-131338 | 10 | 0.1% |
| ca-2014-106439 | 9 | 0.1% |
| ca-2015-132626 | 9 | 0.1% |
| ca-2015-158421 | 9 | 0.1% |
| Other values (4999) | 9881 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 25486 | |
| - | 19972 | |
| 0 | 15478 | |
| 2 | 15369 | |
| C | 8302 | 5.9% |
| A | 8302 | 5.9% |
| 6 | 7900 | 5.7% |
| 7 | 7431 | 5.3% |
| 4 | 7396 | 5.3% |
| 5 | 7332 | 5.2% |
| Other values (5) | 16836 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 99860 | |
| Dash Punctuation | 19972 | 14.3% |
| Uppercase Letter | 19972 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 25486 | |
| 0 | 15478 | |
| 2 | 15369 | |
| 6 | 7900 | 7.9% |
| 7 | 7431 | 7.4% |
| 4 | 7396 | 7.4% |
| 5 | 7332 | 7.3% |
| 3 | 5444 | 5.5% |
| 8 | 4041 | 4.0% |
| 9 | 3983 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 8302 | |
| A | 8302 | |
| U | 1684 | 8.4% |
| S | 1684 | 8.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19972 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 119832 | |
| Latin | 19972 | 14.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 25486 | |
| - | 19972 | |
| 0 | 15478 | |
| 2 | 15369 | |
| 6 | 7900 | 6.6% |
| 7 | 7431 | 6.2% |
| 4 | 7396 | 6.2% |
| 5 | 7332 | 6.1% |
| 3 | 5444 | 4.5% |
| 8 | 4041 | 3.4% |
Latin
| Value | Count | Frequency (%) |
| C | 8302 | |
| A | 8302 | |
| U | 1684 | 8.4% |
| S | 1684 | 8.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 139804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 25486 | |
| - | 19972 | |
| 0 | 15478 | |
| 2 | 15369 | |
| C | 8302 | 5.9% |
| A | 8302 | 5.9% |
| 6 | 7900 | 5.7% |
| 7 | 7431 | 5.3% |
| 4 | 7396 | 5.3% |
| 5 | 7332 | 5.2% |
| Other values (5) | 16836 |
Product ID
Text
| Distinct | 1862 |
|---|---|
| Distinct (%) | 18.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 15 |
Unique
| Unique | 91 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | TEC-PH-10002075 |
|---|---|
| 2nd row | FUR-TA-10003715 |
| 3rd row | OFF-BI-10001597 |
| 4th row | OFF-PA-10000176 |
| 5th row | OFF-BI-10000343 |
| Value | Count | Frequency (%) |
| tec-ac-10003832 | 18 | 0.2% |
| off-pa-10001970 | 18 | 0.2% |
| fur-fu-10004270 | 16 | 0.2% |
| fur-ch-10002647 | 15 | 0.2% |
| tec-ac-10003628 | 15 | 0.2% |
| fur-ch-10001146 | 15 | 0.2% |
| tec-ac-10002049 | 15 | 0.2% |
| off-pa-10002377 | 14 | 0.1% |
| fur-ch-10003774 | 14 | 0.1% |
| off-bi-10001524 | 14 | 0.1% |
| Other values (1852) | 9832 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 35022 | |
| - | 19972 | |
| F | 15336 | |
| 1 | 14985 | |
| O | 6318 | 4.2% |
| 2 | 4859 | 3.2% |
| 4 | 4828 | 3.2% |
| 3 | 4803 | 3.2% |
| A | 4418 | 2.9% |
| 5 | 3398 | 2.3% |
| Other values (17) | 35851 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 79888 | |
| Uppercase Letter | 49930 | |
| Dash Punctuation | 19972 | 13.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 15336 | |
| O | 6318 | |
| A | 4418 | 8.8% |
| C | 3302 | 6.6% |
| U | 3265 | 6.5% |
| T | 3009 | 6.0% |
| R | 2915 | 5.8% |
| P | 2723 | 5.5% |
| E | 2099 | 4.2% |
| B | 1750 | 3.5% |
| Other values (6) | 4795 | 9.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 35022 | |
| 1 | 14985 | |
| 2 | 4859 | 6.1% |
| 4 | 4828 | 6.0% |
| 3 | 4803 | 6.0% |
| 5 | 3398 | 4.3% |
| 7 | 3102 | 3.9% |
| 9 | 3046 | 3.8% |
| 6 | 2993 | 3.7% |
| 8 | 2852 | 3.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19972 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 99860 | |
| Latin | 49930 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 15336 | |
| O | 6318 | |
| A | 4418 | 8.8% |
| C | 3302 | 6.6% |
| U | 3265 | 6.5% |
| T | 3009 | 6.0% |
| R | 2915 | 5.8% |
| P | 2723 | 5.5% |
| E | 2099 | 4.2% |
| B | 1750 | 3.5% |
| Other values (6) | 4795 | 9.6% |
Common
| Value | Count | Frequency (%) |
| 0 | 35022 | |
| - | 19972 | |
| 1 | 14985 | |
| 2 | 4859 | 4.9% |
| 4 | 4828 | 4.8% |
| 3 | 4803 | 4.8% |
| 5 | 3398 | 3.4% |
| 7 | 3102 | 3.1% |
| 9 | 3046 | 3.1% |
| 6 | 2993 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 149790 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 35022 | |
| - | 19972 | |
| F | 15336 | |
| 1 | 14985 | |
| O | 6318 | 4.2% |
| 2 | 4859 | 3.2% |
| 4 | 4828 | 3.2% |
| 3 | 4803 | 3.2% |
| A | 4418 | 2.9% |
| 5 | 3398 | 2.3% |
| Other values (17) | 35851 |
Order Date
Date
| Distinct | 1237 |
|---|---|
| Distinct (%) | 12.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| Minimum | 2014-01-03 00:00:00 |
|---|---|
| Maximum | 2017-12-30 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Ship Date
Date
| Distinct | 1334 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| Minimum | 2014-01-07 00:00:00 |
|---|---|
| Maximum | 2018-01-05 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Ship Mode
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| standard class | |
|---|---|
| second class | |
| first class | |
| same day | 543 |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 12.823052 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | standard class |
|---|---|
| 2nd row | standard class |
| 3rd row | standard class |
| 4th row | standard class |
| 5th row | standard class |
Common Values
| Value | Count | Frequency (%) |
| standard class | 5964 | |
| second class | 1942 | 19.4% |
| first class | 1537 | 15.4% |
| same day | 543 | 5.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| class | 9443 | |
| standard | 5964 | |
| second | 1942 | 9.7% |
| first | 1537 | 7.7% |
| same | 543 | 2.7% |
| day | 543 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 28872 | |
| a | 22457 | |
| d | 14413 | |
| c | 11385 | 8.9% |
| 9986 | 7.8% | |
| l | 9443 | 7.4% |
| n | 7906 | 6.2% |
| t | 7501 | 5.9% |
| r | 7501 | 5.9% |
| e | 2485 | 1.9% |
| Other values (5) | 6102 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 118065 | |
| Space Separator | 9986 | 7.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 28872 | |
| a | 22457 | |
| d | 14413 | |
| c | 11385 | 9.6% |
| l | 9443 | 8.0% |
| n | 7906 | 6.7% |
| t | 7501 | 6.4% |
| r | 7501 | 6.4% |
| e | 2485 | 2.1% |
| o | 1942 | 1.6% |
| Other values (4) | 4160 | 3.5% |
Space Separator
| Value | Count | Frequency (%) |
| 9986 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 118065 | |
| Common | 9986 | 7.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 28872 | |
| a | 22457 | |
| d | 14413 | |
| c | 11385 | 9.6% |
| l | 9443 | 8.0% |
| n | 7906 | 6.7% |
| t | 7501 | 6.4% |
| r | 7501 | 6.4% |
| e | 2485 | 2.1% |
| o | 1942 | 1.6% |
| Other values (4) | 4160 | 3.5% |
Common
| Value | Count | Frequency (%) |
| 9986 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 128051 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 28872 | |
| a | 22457 | |
| d | 14413 | |
| c | 11385 | 8.9% |
| 9986 | 7.8% | |
| l | 9443 | 7.4% |
| n | 7906 | 6.2% |
| t | 7501 | 5.9% |
| r | 7501 | 5.9% |
| e | 2485 | 1.9% |
| Other values (5) | 6102 | 4.8% |
Customer ID
Text
| Distinct | 793 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | dk13375 |
|---|---|
| 2nd row | eb13705 |
| 3rd row | eb13705 |
| 4th row | nf18475 |
| 5th row | jc15340 |
| Value | Count | Frequency (%) |
| wb21850 | 37 | 0.4% |
| pp18955 | 34 | 0.3% |
| ma17560 | 34 | 0.3% |
| jl15835 | 34 | 0.3% |
| jd15895 | 32 | 0.3% |
| eh13765 | 32 | 0.3% |
| sv20365 | 32 | 0.3% |
| ck12205 | 32 | 0.3% |
| ap10915 | 31 | 0.3% |
| ep13915 | 31 | 0.3% |
| Other values (783) | 9657 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 11905 | |
| 0 | 8524 | |
| 5 | 7859 | 11.2% |
| 2 | 4679 | 6.7% |
| 7 | 2927 | 4.2% |
| 6 | 2905 | 4.2% |
| 9 | 2901 | 4.2% |
| 8 | 2817 | 4.0% |
| 3 | 2779 | 4.0% |
| 4 | 2634 | 3.8% |
| Other values (26) | 19972 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 49930 | |
| Lowercase Letter | 19972 | 28.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1795 | 9.0% |
| c | 1723 | 8.6% |
| m | 1711 | 8.6% |
| b | 1638 | 8.2% |
| d | 1296 | 6.5% |
| a | 1226 | 6.1% |
| p | 1134 | 5.7% |
| j | 1133 | 5.7% |
| h | 968 | 4.8% |
| k | 932 | 4.7% |
| Other values (16) | 6416 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 11905 | |
| 0 | 8524 | |
| 5 | 7859 | |
| 2 | 4679 | 9.4% |
| 7 | 2927 | 5.9% |
| 6 | 2905 | 5.8% |
| 9 | 2901 | 5.8% |
| 8 | 2817 | 5.6% |
| 3 | 2779 | 5.6% |
| 4 | 2634 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49930 | |
| Latin | 19972 | 28.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1795 | 9.0% |
| c | 1723 | 8.6% |
| m | 1711 | 8.6% |
| b | 1638 | 8.2% |
| d | 1296 | 6.5% |
| a | 1226 | 6.1% |
| p | 1134 | 5.7% |
| j | 1133 | 5.7% |
| h | 968 | 4.8% |
| k | 932 | 4.7% |
| Other values (16) | 6416 |
Common
| Value | Count | Frequency (%) |
| 1 | 11905 | |
| 0 | 8524 | |
| 5 | 7859 | |
| 2 | 4679 | 9.4% |
| 7 | 2927 | 5.9% |
| 6 | 2905 | 5.8% |
| 9 | 2901 | 5.8% |
| 8 | 2817 | 5.6% |
| 3 | 2779 | 5.6% |
| 4 | 2634 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 69902 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 11905 | |
| 0 | 8524 | |
| 5 | 7859 | 11.2% |
| 2 | 4679 | 6.7% |
| 7 | 2927 | 4.2% |
| 6 | 2905 | 4.2% |
| 9 | 2901 | 4.2% |
| 8 | 2817 | 4.0% |
| 3 | 2779 | 4.0% |
| 4 | 2634 | 3.8% |
| Other values (26) | 19972 |
Customer Name
Text
| Distinct | 793 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 22 |
|---|---|
| Median length | 18 |
| Mean length | 12.945524 |
| Min length | 7 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | dennis kane |
|---|---|
| 2nd row | ed braxton |
| 3rd row | ed braxton |
| 4th row | neil französisch |
| 5th row | jasper cacioppo |
| Value | Count | Frequency (%) |
| michael | 120 | 0.6% |
| frank | 112 | 0.6% |
| john | 107 | 0.5% |
| patrick | 96 | 0.5% |
| stewart | 93 | 0.5% |
| paul | 92 | 0.5% |
| brian | 92 | 0.5% |
| ken | 91 | 0.5% |
| rick | 91 | 0.5% |
| matt | 86 | 0.4% |
| Other values (901) | 19057 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 13280 | 10.3% |
| e | 12451 | 9.6% |
| n | 10787 | 8.3% |
| r | 10394 | 8.0% |
| 10051 | 7.8% | |
| i | 7992 | 6.2% |
| l | 7331 | 5.7% |
| s | 6336 | 4.9% |
| t | 6256 | 4.8% |
| o | 6077 | 4.7% |
| Other values (20) | 38319 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 119223 | |
| Space Separator | 10051 | 7.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 13280 | 11.1% |
| e | 12451 | 10.4% |
| n | 10787 | 9.0% |
| r | 10394 | 8.7% |
| i | 7992 | 6.7% |
| l | 7331 | 6.1% |
| s | 6336 | 5.3% |
| t | 6256 | 5.2% |
| o | 6077 | 5.1% |
| h | 4860 | 4.1% |
| Other values (19) | 33459 |
Space Separator
| Value | Count | Frequency (%) |
| 10051 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 119223 | |
| Common | 10051 | 7.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 13280 | 11.1% |
| e | 12451 | 10.4% |
| n | 10787 | 9.0% |
| r | 10394 | 8.7% |
| i | 7992 | 6.7% |
| l | 7331 | 6.1% |
| s | 6336 | 5.3% |
| t | 6256 | 5.2% |
| o | 6077 | 5.1% |
| h | 4860 | 4.1% |
| Other values (19) | 33459 |
Common
| Value | Count | Frequency (%) |
| 10051 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 129185 | |
| None | 89 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 13280 | 10.3% |
| e | 12451 | 9.6% |
| n | 10787 | 8.4% |
| r | 10394 | 8.0% |
| 10051 | 7.8% | |
| i | 7992 | 6.2% |
| l | 7331 | 5.7% |
| s | 6336 | 4.9% |
| t | 6256 | 4.8% |
| o | 6077 | 4.7% |
| Other values (17) | 38230 |
None
| Value | Count | Frequency (%) |
| ö | 61 | |
| ä | 23 | 25.8% |
| ü | 5 | 5.6% |
Segment
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| consumer | |
|---|---|
| corporate | |
| home office |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 8.8364711 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | consumer |
|---|---|
| 2nd row | corporate |
| 3rd row | corporate |
| 4th row | home office |
| 5th row | consumer |
Common Values
| Value | Count | Frequency (%) |
| consumer | 5189 | |
| corporate | 3019 | |
| home office | 1778 | 17.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| consumer | 5189 | |
| corporate | 3019 | |
| home | 1778 | 15.1% |
| office | 1778 | 15.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 14783 | |
| e | 11764 | |
| r | 11227 | |
| c | 9986 | |
| m | 6967 | |
| n | 5189 | 5.9% |
| s | 5189 | 5.9% |
| u | 5189 | 5.9% |
| f | 3556 | 4.0% |
| p | 3019 | 3.4% |
| Other values (5) | 11372 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 86463 | |
| Space Separator | 1778 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 14783 | |
| e | 11764 | |
| r | 11227 | |
| c | 9986 | |
| m | 6967 | |
| n | 5189 | 6.0% |
| s | 5189 | 6.0% |
| u | 5189 | 6.0% |
| f | 3556 | 4.1% |
| p | 3019 | 3.5% |
| Other values (4) | 9594 |
Space Separator
| Value | Count | Frequency (%) |
| 1778 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 86463 | |
| Common | 1778 | 2.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 14783 | |
| e | 11764 | |
| r | 11227 | |
| c | 9986 | |
| m | 6967 | |
| n | 5189 | 6.0% |
| s | 5189 | 6.0% |
| u | 5189 | 6.0% |
| f | 3556 | 4.1% |
| p | 3019 | 3.5% |
| Other values (4) | 9594 |
Common
| Value | Count | Frequency (%) |
| 1778 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 88241 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 14783 | |
| e | 11764 | |
| r | 11227 | |
| c | 9986 | |
| m | 6967 | |
| n | 5189 | 5.9% |
| s | 5189 | 5.9% |
| u | 5189 | 5.9% |
| f | 3556 | 4.0% |
| p | 3019 | 3.4% |
| Other values (5) | 11372 |
Country
Categorical
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| united states |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | united states |
|---|---|
| 2nd row | united states |
| 3rd row | united states |
| 4th row | united states |
| 5th row | united states |
Common Values
| Value | Count | Frequency (%) |
| united states | 9986 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| united | 9986 | |
| states | 9986 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 29958 | |
| e | 19972 | |
| s | 19972 | |
| u | 9986 | 7.7% |
| n | 9986 | 7.7% |
| i | 9986 | 7.7% |
| d | 9986 | 7.7% |
| 9986 | 7.7% | |
| a | 9986 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 119832 | |
| Space Separator | 9986 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 29958 | |
| e | 19972 | |
| s | 19972 | |
| u | 9986 | 8.3% |
| n | 9986 | 8.3% |
| i | 9986 | 8.3% |
| d | 9986 | 8.3% |
| a | 9986 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 9986 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 119832 | |
| Common | 9986 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 29958 | |
| e | 19972 | |
| s | 19972 | |
| u | 9986 | 8.3% |
| n | 9986 | 8.3% |
| i | 9986 | 8.3% |
| d | 9986 | 8.3% |
| a | 9986 | 8.3% |
Common
| Value | Count | Frequency (%) |
| 9986 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 129818 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 29958 | |
| e | 19972 | |
| s | 19972 | |
| u | 9986 | 7.7% |
| n | 9986 | 7.7% |
| i | 9986 | 7.7% |
| d | 9986 | 7.7% |
| 9986 | 7.7% | |
| a | 9986 | 7.7% |
City
Text
| Distinct | 531 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 9.3308632 |
| Min length | 4 |
Unique
| Unique | 70 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | new york city |
|---|---|
| 2nd row | san francisco |
| 3rd row | san francisco |
| 4th row | jacksonville |
| 5th row | new york city |
| Value | Count | Frequency (%) |
| city | 993 | 7.0% |
| new | 936 | 6.6% |
| york | 919 | 6.5% |
| san | 805 | 5.7% |
| los | 747 | 5.3% |
| angeles | 747 | 5.3% |
| philadelphia | 537 | 3.8% |
| francisco | 510 | 3.6% |
| seattle | 428 | 3.0% |
| houston | 377 | 2.7% |
| Other values (555) | 7227 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8829 | 9.5% |
| e | 8828 | 9.5% |
| o | 7679 | 8.2% |
| n | 7327 | 7.9% |
| l | 7275 | 7.8% |
| s | 6434 | 6.9% |
| i | 6273 | 6.7% |
| r | 4850 | 5.2% |
| t | 4691 | 5.0% |
| c | 4474 | 4.8% |
| Other values (17) | 26518 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 88938 | |
| Space Separator | 4240 | 4.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8829 | 9.9% |
| e | 8828 | 9.9% |
| o | 7679 | 8.6% |
| n | 7327 | 8.2% |
| l | 7275 | 8.2% |
| s | 6434 | 7.2% |
| i | 6273 | 7.1% |
| r | 4850 | 5.5% |
| t | 4691 | 5.3% |
| c | 4474 | 5.0% |
| Other values (16) | 22278 |
Space Separator
| Value | Count | Frequency (%) |
| 4240 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 88938 | |
| Common | 4240 | 4.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8829 | 9.9% |
| e | 8828 | 9.9% |
| o | 7679 | 8.6% |
| n | 7327 | 8.2% |
| l | 7275 | 8.2% |
| s | 6434 | 7.2% |
| i | 6273 | 7.1% |
| r | 4850 | 5.5% |
| t | 4691 | 5.3% |
| c | 4474 | 5.0% |
| Other values (16) | 22278 |
Common
| Value | Count | Frequency (%) |
| 4240 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 93178 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8829 | 9.5% |
| e | 8828 | 9.5% |
| o | 7679 | 8.2% |
| n | 7327 | 7.9% |
| l | 7275 | 7.8% |
| s | 6434 | 6.9% |
| i | 6273 | 6.7% |
| r | 4850 | 5.2% |
| t | 4691 | 5.0% |
| c | 4474 | 4.8% |
| Other values (17) | 26518 |
State
Categorical
High correlation 
| Distinct | 49 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| california | |
|---|---|
| new york | |
| texas | |
| pennsylvania | |
| washington | |
| Other values (44) |
Length
| Max length | 20 |
|---|---|
| Median length | 14 |
| Mean length | 8.4870819 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | new york |
|---|---|
| 2nd row | california |
| 3rd row | california |
| 4th row | florida |
| 5th row | new york |
Common Values
| Value | Count | Frequency (%) |
| california | 2001 | |
| new york | 1127 | 11.3% |
| texas | 985 | 9.9% |
| pennsylvania | 587 | 5.9% |
| washington | 506 | 5.1% |
| illinois | 492 | 4.9% |
| ohio | 468 | 4.7% |
| florida | 383 | 3.8% |
| michigan | 255 | 2.6% |
| north carolina | 248 | 2.5% |
| Other values (39) | 2934 |
Length
| Value | Count | Frequency (%) |
| california | 2001 | |
| new | 1321 | 11.3% |
| york | 1127 | 9.6% |
| texas | 985 | 8.4% |
| pennsylvania | 587 | 5.0% |
| washington | 506 | 4.3% |
| illinois | 492 | 4.2% |
| ohio | 468 | 4.0% |
| florida | 383 | 3.3% |
| carolina | 290 | 2.5% |
| Other values (43) | 3536 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 11097 | |
| i | 10634 | |
| n | 9739 | |
| o | 7974 | |
| r | 5594 | 6.6% |
| e | 5049 | 6.0% |
| l | 4861 | 5.7% |
| s | 4654 | 5.5% |
| c | 3413 | 4.0% |
| t | 2766 | 3.3% |
| Other values (16) | 18971 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 83042 | |
| Space Separator | 1710 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11097 | |
| i | 10634 | |
| n | 9739 | |
| o | 7974 | |
| r | 5594 | 6.7% |
| e | 5049 | 6.1% |
| l | 4861 | 5.9% |
| s | 4654 | 5.6% |
| c | 3413 | 4.1% |
| t | 2766 | 3.3% |
| Other values (15) | 17261 |
Space Separator
| Value | Count | Frequency (%) |
| 1710 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 83042 | |
| Common | 1710 | 2.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11097 | |
| i | 10634 | |
| n | 9739 | |
| o | 7974 | |
| r | 5594 | 6.7% |
| e | 5049 | 6.1% |
| l | 4861 | 5.9% |
| s | 4654 | 5.6% |
| c | 3413 | 4.1% |
| t | 2766 | 3.3% |
| Other values (15) | 17261 |
Common
| Value | Count | Frequency (%) |
| 1710 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 84752 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 11097 | |
| i | 10634 | |
| n | 9739 | |
| o | 7974 | |
| r | 5594 | 6.6% |
| e | 5049 | 6.0% |
| l | 4861 | 5.7% |
| s | 4654 | 5.5% |
| c | 3413 | 4.0% |
| t | 2766 | 3.3% |
| Other values (16) | 18971 |
Postal Code
Real number (ℝ)
High correlation 
| Distinct | 631 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55206.14 |
| Minimum | 1040 |
|---|---|
| Maximum | 99301 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 1040 |
|---|---|
| 5-th percentile | 10009 |
| Q1 | 23223 |
| median | 57103 |
| Q3 | 90008 |
| 95-th percentile | 98006 |
| Maximum | 99301 |
| Range | 98261 |
| Interquartile range (IQR) | 66785 |
Descriptive statistics
| Standard deviation | 32066.719 |
|---|---|
| Coefficient of variation (CV) | 0.58085421 |
| Kurtosis | -1.4928665 |
| Mean | 55206.14 |
| Median Absolute Deviation (MAD) | 32929 |
| Skewness | -0.12951533 |
| Sum | 5.5128851 × 108 |
| Variance | 1.0282744 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10035 | 263 | 2.6% |
| 10024 | 230 | 2.3% |
| 10009 | 228 | 2.3% |
| 94122 | 203 | 2.0% |
| 10011 | 193 | 1.9% |
| 94110 | 166 | 1.7% |
| 98105 | 165 | 1.7% |
| 19134 | 160 | 1.6% |
| 90049 | 151 | 1.5% |
| 98103 | 151 | 1.5% |
| Other values (621) | 8076 |
| Value | Count | Frequency (%) |
| 1040 | 1 | < 0.1% |
| 1453 | 6 | 0.1% |
| 1752 | 2 | < 0.1% |
| 1810 | 4 | < 0.1% |
| 1841 | 33 | |
| 1852 | 16 | |
| 1915 | 3 | < 0.1% |
| 2038 | 17 | |
| 2138 | 6 | 0.1% |
| 2148 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 99301 | 6 | 0.1% |
| 99207 | 7 | 0.1% |
| 98661 | 5 | 0.1% |
| 98632 | 3 | < 0.1% |
| 98502 | 5 | 0.1% |
| 98270 | 2 | < 0.1% |
| 98226 | 3 | < 0.1% |
| 98208 | 1 | < 0.1% |
| 98198 | 7 | 0.1% |
| 98115 | 112 |
Region
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| west | |
|---|---|
| east | |
| central | |
| south |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.8597036 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | east |
|---|---|
| 2nd row | west |
| 3rd row | west |
| 4th row | south |
| 5th row | east |
Common Values
| Value | Count | Frequency (%) |
| west | 3202 | |
| east | 2845 | |
| central | 2323 | |
| south | 1616 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| west | 3202 | |
| east | 2845 | |
| central | 2323 | |
| south | 1616 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 9986 | |
| e | 8370 | |
| s | 7663 | |
| a | 5168 | |
| w | 3202 | 6.6% |
| c | 2323 | 4.8% |
| n | 2323 | 4.8% |
| r | 2323 | 4.8% |
| l | 2323 | 4.8% |
| o | 1616 | 3.3% |
| Other values (2) | 3232 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 48529 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 9986 | |
| e | 8370 | |
| s | 7663 | |
| a | 5168 | |
| w | 3202 | 6.6% |
| c | 2323 | 4.8% |
| n | 2323 | 4.8% |
| r | 2323 | 4.8% |
| l | 2323 | 4.8% |
| o | 1616 | 3.3% |
| Other values (2) | 3232 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48529 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 9986 | |
| e | 8370 | |
| s | 7663 | |
| a | 5168 | |
| w | 3202 | 6.6% |
| c | 2323 | 4.8% |
| n | 2323 | 4.8% |
| r | 2323 | 4.8% |
| l | 2323 | 4.8% |
| o | 1616 | 3.3% |
| Other values (2) | 3232 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48529 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 9986 | |
| e | 8370 | |
| s | 7663 | |
| a | 5168 | |
| w | 3202 | 6.6% |
| c | 2323 | 4.8% |
| n | 2323 | 4.8% |
| r | 2323 | 4.8% |
| l | 2323 | 4.8% |
| o | 1616 | 3.3% |
| Other values (2) | 3232 | 6.7% |
Category
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| office supplies | |
|---|---|
| furniture | |
| technology |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 12.803024 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | technology |
|---|---|
| 2nd row | furniture |
| 3rd row | office supplies |
| 4th row | office supplies |
| 5th row | office supplies |
Common Values
| Value | Count | Frequency (%) |
| office supplies | 6022 | |
| furniture | 2119 | 21.2% |
| technology | 1845 | 18.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| office | 6022 | |
| supplies | 6022 | |
| furniture | 2119 | 13.2% |
| technology | 1845 | 11.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 16008 | |
| f | 14163 | |
| i | 14163 | |
| s | 12044 | |
| p | 12044 | |
| u | 10260 | |
| o | 9712 | |
| c | 7867 | |
| l | 7867 | |
| 6022 | 4.7% | |
| Other values (6) | 17701 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 121829 | |
| Space Separator | 6022 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 16008 | |
| f | 14163 | |
| i | 14163 | |
| s | 12044 | |
| p | 12044 | |
| u | 10260 | |
| o | 9712 | |
| c | 7867 | |
| l | 7867 | |
| r | 4238 | 3.5% |
| Other values (5) | 13463 |
Space Separator
| Value | Count | Frequency (%) |
| 6022 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 121829 | |
| Common | 6022 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 16008 | |
| f | 14163 | |
| i | 14163 | |
| s | 12044 | |
| p | 12044 | |
| u | 10260 | |
| o | 9712 | |
| c | 7867 | |
| l | 7867 | |
| r | 4238 | 3.5% |
| Other values (5) | 13463 |
Common
| Value | Count | Frequency (%) |
| 6022 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 127851 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 16008 | |
| f | 14163 | |
| i | 14163 | |
| s | 12044 | |
| p | 12044 | |
| u | 10260 | |
| o | 9712 | |
| c | 7867 | |
| l | 7867 | |
| 6022 | 4.7% | |
| Other values (6) | 17701 |
Sub-Category
Categorical
High correlation 
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| binders | |
|---|---|
| paper | |
| furnishings | |
| phones | |
| storage | |
| Other values (12) |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 7.1911676 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | phones |
|---|---|
| 2nd row | tables |
| 3rd row | binders |
| 4th row | paper |
| 5th row | binders |
Common Values
| Value | Count | Frequency (%) |
| binders | 1522 | |
| paper | 1368 | |
| furnishings | 956 | |
| phones | 889 | |
| storage | 845 | |
| art | 796 | |
| accessories | 773 | |
| chairs | 616 | |
| appliances | 466 | 4.7% |
| labels | 364 | 3.6% |
| Other values (7) | 1391 |
Length
| Value | Count | Frequency (%) |
| binders | 1522 | |
| paper | 1368 | |
| furnishings | 956 | |
| phones | 889 | |
| storage | 845 | |
| art | 796 | |
| accessories | 773 | |
| chairs | 616 | |
| appliances | 466 | 4.7% |
| labels | 364 | 3.6% |
| Other values (7) | 1391 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 10959 | |
| e | 9116 | |
| r | 7161 | |
| a | 6573 | |
| i | 5662 | |
| n | 5375 | |
| p | 5259 | |
| o | 3285 | 4.6% |
| c | 3039 | 4.2% |
| h | 2576 | 3.6% |
| Other values (10) | 12806 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 71811 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 10959 | |
| e | 9116 | |
| r | 7161 | |
| a | 6573 | |
| i | 5662 | |
| n | 5375 | |
| p | 5259 | |
| o | 3285 | 4.6% |
| c | 3039 | 4.2% |
| h | 2576 | 3.6% |
| Other values (10) | 12806 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 71811 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 10959 | |
| e | 9116 | |
| r | 7161 | |
| a | 6573 | |
| i | 5662 | |
| n | 5375 | |
| p | 5259 | |
| o | 3285 | 4.6% |
| c | 3039 | 4.2% |
| h | 2576 | 3.6% |
| Other values (10) | 12806 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 71811 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 10959 | |
| e | 9116 | |
| r | 7161 | |
| a | 6573 | |
| i | 5662 | |
| n | 5375 | |
| p | 5259 | |
| o | 3285 | 4.6% |
| c | 3039 | 4.2% |
| h | 2576 | 3.6% |
| Other values (10) | 12806 |
Product Name
Text
| Distinct | 1846 |
|---|---|
| Distinct (%) | 18.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 120 |
|---|---|
| Median length | 74 |
| Mean length | 35.88584 |
| Min length | 5 |
Unique
| Unique | 91 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | att el51110 dect |
|---|---|
| 2nd row | hon 2111 invitation series corner table |
| 3rd row | wilson jones ledgersize pianohinge binder 2 blue |
| 4th row | xerox 1887 |
| 5th row | pressboard covers with storage hooks 9 12 x 11 light blue |
| Value | Count | Frequency (%) |
| xerox | 863 | 1.6% |
| x | 701 | 1.3% |
| with | 598 | 1.1% |
| avery | 557 | 1.0% |
| for | 538 | 1.0% |
| binders | 524 | 0.9% |
| chair | 478 | 0.9% |
| black | 424 | 0.8% |
| phone | 374 | 0.7% |
| gbc | 341 | 0.6% |
| Other values (2753) | 49989 |
Most occurring characters
| Value | Count | Frequency (%) |
| 45616 | 12.7% | |
| e | 35563 | 9.9% |
| r | 22802 | 6.4% |
| a | 21990 | 6.1% |
| o | 21286 | 5.9% |
| s | 20954 | 5.8% |
| i | 19940 | 5.6% |
| l | 18633 | 5.2% |
| t | 17147 | 4.8% |
| n | 16406 | 4.6% |
| Other values (33) | 118019 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 294261 | |
| Space Separator | 46041 | 12.8% |
| Decimal Number | 17963 | 5.0% |
| Control | 86 | < 0.1% |
| Other Number | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 35563 | |
| r | 22802 | 7.7% |
| a | 21990 | 7.5% |
| o | 21286 | 7.2% |
| s | 20954 | 7.1% |
| i | 19940 | 6.8% |
| l | 18633 | 6.3% |
| t | 17147 | 5.8% |
| n | 16406 | 5.6% |
| c | 14915 | 5.1% |
| Other values (18) | 84625 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3777 | |
| 0 | 2920 | |
| 2 | 2268 | |
| 4 | 1723 | |
| 3 | 1530 | |
| 5 | 1443 | 8.0% |
| 8 | 1252 | 7.0% |
| 9 | 1232 | 6.9% |
| 6 | 938 | 5.2% |
| 7 | 880 | 4.9% |
Space Separator
| Value | Count | Frequency (%) |
| 45616 | ||
| 425 | 0.9% |
Control
| Value | Count | Frequency (%) |
| | 67 | |
| | 19 | 22.1% |
Other Number
| Value | Count | Frequency (%) |
| ¾ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 294261 | |
| Common | 64095 | 17.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 35563 | |
| r | 22802 | 7.7% |
| a | 21990 | 7.5% |
| o | 21286 | 7.2% |
| s | 20954 | 7.1% |
| i | 19940 | 6.8% |
| l | 18633 | 6.3% |
| t | 17147 | 5.8% |
| n | 16406 | 5.6% |
| c | 14915 | 5.1% |
| Other values (18) | 84625 |
Common
| Value | Count | Frequency (%) |
| 45616 | ||
| 1 | 3777 | 5.9% |
| 0 | 2920 | 4.6% |
| 2 | 2268 | 3.5% |
| 4 | 1723 | 2.7% |
| 3 | 1530 | 2.4% |
| 5 | 1443 | 2.3% |
| 8 | 1252 | 2.0% |
| 9 | 1232 | 1.9% |
| 6 | 938 | 1.5% |
| Other values (5) | 1396 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 357823 | |
| None | 533 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 45616 | 12.7% | |
| e | 35563 | 9.9% |
| r | 22802 | 6.4% |
| a | 21990 | 6.1% |
| o | 21286 | 5.9% |
| s | 20954 | 5.9% |
| i | 19940 | 5.6% |
| l | 18633 | 5.2% |
| t | 17147 | 4.8% |
| n | 16406 | 4.6% |
| Other values (27) | 117486 |
None
| Value | Count | Frequency (%) |
| 425 | ||
| | 67 | 12.6% |
| | 19 | 3.6% |
| é | 14 | 2.6% |
| ¾ | 5 | 0.9% |
| à | 3 | 0.6% |
Sales
Real number (ℝ)
High correlation 
| Distinct | 5826 |
|---|---|
| Distinct (%) | 58.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 230.04215 |
| Minimum | 0.444 |
|---|---|
| Maximum | 22638.48 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0.444 |
|---|---|
| 5-th percentile | 4.98 |
| Q1 | 17.248 |
| median | 54.432 |
| Q3 | 209.9375 |
| 95-th percentile | 957.34933 |
| Maximum | 22638.48 |
| Range | 22638.036 |
| Interquartile range (IQR) | 192.6895 |
Descriptive statistics
| Standard deviation | 623.66752 |
|---|---|
| Coefficient of variation (CV) | 2.7111011 |
| Kurtosis | 304.71558 |
| Mean | 230.04215 |
| Median Absolute Deviation (MAD) | 45.42 |
| Skewness | 12.957949 |
| Sum | 2297200.9 |
| Variance | 388961.17 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.96 | 56 | 0.6% |
| 19.44 | 39 | 0.4% |
| 15.552 | 39 | 0.4% |
| 10.368 | 36 | 0.4% |
| 25.92 | 36 | 0.4% |
| 32.4 | 28 | 0.3% |
| 6.48 | 21 | 0.2% |
| 17.94 | 21 | 0.2% |
| 20.736 | 19 | 0.2% |
| 14.94 | 17 | 0.2% |
| Other values (5816) | 9674 |
| Value | Count | Frequency (%) |
| 0.444 | 1 | < 0.1% |
| 0.556 | 1 | < 0.1% |
| 0.836 | 1 | < 0.1% |
| 0.852 | 1 | < 0.1% |
| 0.876 | 1 | < 0.1% |
| 0.898 | 1 | < 0.1% |
| 0.984 | 1 | < 0.1% |
| 0.99 | 1 | < 0.1% |
| 1.044 | 1 | < 0.1% |
| 1.08 | 3 |
| Value | Count | Frequency (%) |
| 22638.48 | 1 | |
| 17499.95 | 1 | |
| 13999.96 | 1 | |
| 11199.968 | 1 | |
| 10499.97 | 1 | |
| 9892.74 | 1 | |
| 9449.95 | 1 | |
| 9099.93 | 1 | |
| 8749.95 | 1 | |
| 8399.976 | 1 |
Quantity
Real number (ℝ)
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.7926097 |
| Minimum | 1 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 16 |
| Range | 15 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.2333901 |
|---|---|
| Coefficient of variation (CV) | 0.58887952 |
| Kurtosis | 2.0998452 |
| Mean | 3.7926097 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.2980663 |
| Sum | 37873 |
| Variance | 4.9880315 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 2407 | |
| 2 | 2398 | |
| 5 | 1229 | |
| 4 | 1191 | |
| 1 | 899 | 9.0% |
| 7 | 605 | 6.1% |
| 6 | 570 | 5.7% |
| 9 | 257 | 2.6% |
| 8 | 256 | 2.6% |
| 10 | 57 | 0.6% |
| Other values (6) | 117 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 899 | 9.0% |
| 2 | 2398 | |
| 3 | 2407 | |
| 4 | 1191 | |
| 5 | 1229 | |
| 6 | 570 | 5.7% |
| 7 | 605 | 6.1% |
| 8 | 256 | 2.6% |
| 9 | 257 | 2.6% |
| 10 | 57 | 0.6% |
| Value | Count | Frequency (%) |
| 16 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 14 | 29 | 0.3% |
| 13 | 27 | 0.3% |
| 12 | 25 | 0.3% |
| 11 | 34 | 0.3% |
| 10 | 57 | 0.6% |
| 9 | 257 | |
| 8 | 256 | |
| 7 | 605 |
Discount
Real number (ℝ)
High correlation  Zeros 
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.15625776 |
| Minimum | 0 |
|---|---|
| Maximum | 0.8 |
| Zeros | 4793 |
| Zeros (%) | 48.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.2 |
| Q3 | 0.2 |
| 95-th percentile | 0.7 |
| Maximum | 0.8 |
| Range | 0.8 |
| Interquartile range (IQR) | 0.2 |
Descriptive statistics
| Standard deviation | 0.20649912 |
|---|---|
| Coefficient of variation (CV) | 1.3215288 |
| Kurtosis | 2.4069541 |
| Mean | 0.15625776 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 1.6838712 |
| Sum | 1560.39 |
| Variance | 0.042641888 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4793 | |
| 0.2 | 3655 | |
| 0.7 | 418 | 4.2% |
| 0.8 | 300 | 3.0% |
| 0.3 | 226 | 2.3% |
| 0.4 | 206 | 2.1% |
| 0.6 | 138 | 1.4% |
| 0.1 | 94 | 0.9% |
| 0.5 | 66 | 0.7% |
| 0.15 | 52 | 0.5% |
| Other values (2) | 38 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 4793 | |
| 0.1 | 94 | 0.9% |
| 0.15 | 52 | 0.5% |
| 0.2 | 3655 | |
| 0.3 | 226 | 2.3% |
| 0.32 | 27 | 0.3% |
| 0.4 | 206 | 2.1% |
| 0.45 | 11 | 0.1% |
| 0.5 | 66 | 0.7% |
| 0.6 | 138 | 1.4% |
| Value | Count | Frequency (%) |
| 0.8 | 300 | 3.0% |
| 0.7 | 418 | 4.2% |
| 0.6 | 138 | 1.4% |
| 0.5 | 66 | 0.7% |
| 0.45 | 11 | 0.1% |
| 0.4 | 206 | 2.1% |
| 0.32 | 27 | 0.3% |
| 0.3 | 226 | 2.3% |
| 0.2 | 3655 | |
| 0.15 | 52 | 0.5% |
Profit
Real number (ℝ)
High correlation 
| Distinct | 7284 |
|---|---|
| Distinct (%) | 72.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.679854 |
| Minimum | -6599.978 |
|---|---|
| Maximum | 8399.976 |
| Zeros | 65 |
| Zeros (%) | 0.7% |
| Negative | 1870 |
| Negative (%) | 18.7% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -6599.978 |
|---|---|
| 5-th percentile | -53.0562 |
| Q1 | 1.728 |
| median | 8.64135 |
| Q3 | 29.3625 |
| 95-th percentile | 169.0041 |
| Maximum | 8399.976 |
| Range | 14999.954 |
| Interquartile range (IQR) | 27.6345 |
Descriptive statistics
| Standard deviation | 234.39483 |
|---|---|
| Coefficient of variation (CV) | 8.172804 |
| Kurtosis | 396.58955 |
| Mean | 28.679854 |
| Median Absolute Deviation (MAD) | 10.78535 |
| Skewness | 7.5552389 |
| Sum | 286397.02 |
| Variance | 54940.934 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 65 | 0.7% |
| 6.2208 | 43 | 0.4% |
| 9.3312 | 38 | 0.4% |
| 3.6288 | 32 | 0.3% |
| 5.4432 | 32 | 0.3% |
| 15.552 | 26 | 0.3% |
| 12.4416 | 21 | 0.2% |
| 7.2576 | 19 | 0.2% |
| 3.1104 | 18 | 0.2% |
| 9.072 | 11 | 0.1% |
| Other values (7274) | 9681 |
| Value | Count | Frequency (%) |
| -6599.978 | 1 | |
| -3839.9904 | 1 | |
| -3701.8928 | 1 | |
| -3399.98 | 1 | |
| -2929.4845 | 1 | |
| -2639.9912 | 1 | |
| -2287.782 | 1 | |
| -1862.3124 | 1 | |
| -1850.9464 | 1 | |
| -1811.0784 | 1 |
| Value | Count | Frequency (%) |
| 8399.976 | 1 | |
| 6719.9808 | 1 | |
| 5039.9856 | 1 | |
| 4946.37 | 1 | |
| 4630.4755 | 1 | |
| 3919.9888 | 1 | |
| 3177.475 | 1 | |
| 2799.984 | 1 | |
| 2591.9568 | 1 | |
| 2504.2216 | 1 |
Profit Margin
Real number (ℝ)
High correlation 
| Distinct | 572 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.12018479 |
| Minimum | -2.75 |
|---|---|
| Maximum | 0.5 |
| Zeros | 65 |
| Zeros (%) | 0.7% |
| Negative | 1870 |
| Negative (%) | 18.7% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -2.75 |
|---|---|
| 5-th percentile | -0.76666667 |
| Q1 | 0.075 |
| median | 0.27 |
| Q3 | 0.3625 |
| 95-th percentile | 0.48 |
| Maximum | 0.5 |
| Range | 3.25 |
| Interquartile range (IQR) | 0.2875 |
Descriptive statistics
| Standard deviation | 0.46689386 |
|---|---|
| Coefficient of variation (CV) | 3.8848 |
| Kurtosis | 10.164857 |
| Mean | 0.12018479 |
| Median Absolute Deviation (MAD) | 0.17 |
| Skewness | -2.8938446 |
| Sum | 1200.1653 |
| Variance | 0.21798988 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.26 | 258 | 2.6% |
| 0.35 | 246 | 2.5% |
| 0.48 | 204 | 2.0% |
| 0.47 | 201 | 2.0% |
| 0.48 | 189 | 1.9% |
| 0.35 | 185 | 1.9% |
| 0.3625 | 177 | 1.8% |
| 0.325 | 171 | 1.7% |
| 0.29 | 169 | 1.7% |
| 0.125 | 163 | 1.6% |
| Other values (562) | 8023 |
| Value | Count | Frequency (%) |
| -2.75 | 4 | |
| -2.7 | 8 | |
| -2.7 | 6 | |
| -2.65 | 5 | |
| -2.6 | 6 | |
| -2.6 | 4 | |
| -2.55 | 4 | |
| -2.55 | 9 | |
| -2.5 | 9 | |
| -2.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.5 | 140 | |
| 0.49 | 5 | 0.1% |
| 0.49 | 74 | 0.7% |
| 0.49 | 157 | |
| 0.49 | 79 | 0.8% |
| 0.48 | 10 | 0.1% |
| 0.48 | 189 | |
| 0.48 | 204 | |
| 0.48 | 106 | |
| 0.47 | 18 | 0.2% |
Interactions
Correlations
| Category | Discount | Postal Code | Profit | Profit Margin | Quantity | Region | Sales | Segment | Ship Mode | State | Sub-Category | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Category | 1.000 | 0.377 | 0.000 | 0.057 | 0.271 | 0.000 | 0.000 | 0.072 | 0.000 | 0.000 | 0.018 | 0.999 |
| Discount | 0.377 | 1.000 | 0.052 | -0.543 | -0.645 | -0.001 | 0.294 | -0.057 | 0.005 | 0.027 | 0.354 | 0.353 |
| Postal Code | 0.000 | 0.052 | 1.000 | -0.005 | -0.028 | 0.013 | 0.921 | -0.002 | 0.035 | 0.038 | 0.968 | 0.000 |
| Profit | 0.057 | -0.543 | -0.005 | 1.000 | 0.500 | 0.234 | 0.021 | 0.518 | 0.000 | 0.005 | 0.017 | 0.130 |
| Profit Margin | 0.271 | -0.645 | -0.028 | 0.500 | 1.000 | 0.001 | 0.204 | -0.200 | 0.016 | 0.012 | 0.228 | 0.304 |
| Quantity | 0.000 | -0.001 | 0.013 | 0.234 | 0.001 | 1.000 | 0.015 | 0.328 | 0.016 | 0.000 | 0.026 | 0.000 |
| Region | 0.000 | 0.294 | 0.921 | 0.021 | 0.204 | 0.015 | 1.000 | 0.000 | 0.000 | 0.022 | 0.998 | 0.000 |
| Sales | 0.072 | -0.057 | -0.002 | 0.518 | -0.200 | 0.328 | 0.000 | 1.000 | 0.002 | 0.000 | 0.000 | 0.142 |
| Segment | 0.000 | 0.005 | 0.035 | 0.000 | 0.016 | 0.016 | 0.000 | 0.002 | 1.000 | 0.033 | 0.090 | 0.000 |
| Ship Mode | 0.000 | 0.027 | 0.038 | 0.005 | 0.012 | 0.000 | 0.022 | 0.000 | 0.033 | 1.000 | 0.096 | 0.007 |
| State | 0.018 | 0.354 | 0.968 | 0.017 | 0.228 | 0.026 | 0.998 | 0.000 | 0.090 | 0.096 | 1.000 | 0.000 |
| Sub-Category | 0.999 | 0.353 | 0.000 | 0.130 | 0.304 | 0.000 | 0.000 | 0.142 | 0.000 | 0.007 | 0.000 | 1.000 |
Missing values
Sample
| Order ID | Product ID | Order Date | Ship Date | Ship Mode | Customer ID | Customer Name | Segment | Country | City | State | Postal Code | Region | Category | Sub-Category | Product Name | Sales | Quantity | Discount | Profit | Profit Margin | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | CA-2014-100006 | TEC-PH-10002075 | 9/7/2014 | 9/13/2014 | standard class | dk13375 | dennis kane | consumer | united states | new york city | new york | 10024 | east | technology | phones | att el51110 dect | 377.970 | 3 | 0.0 | 109.6113 | 0.290000 |
| 1 | CA-2014-100090 | FUR-TA-10003715 | 7/8/2014 | 7/12/2014 | standard class | eb13705 | ed braxton | corporate | united states | san francisco | california | 94122 | west | furniture | tables | hon 2111 invitation series corner table | 502.488 | 3 | 0.2 | -87.9354 | -0.175000 |
| 2 | CA-2014-100090 | OFF-BI-10001597 | 7/8/2014 | 7/12/2014 | standard class | eb13705 | ed braxton | corporate | united states | san francisco | california | 94122 | west | office supplies | binders | wilson jones ledgersize pianohinge binder 2 blue | 196.704 | 6 | 0.2 | 68.8464 | 0.350000 |
| 3 | CA-2014-100293 | OFF-PA-10000176 | 3/14/2014 | 3/18/2014 | standard class | nf18475 | neil französisch | home office | united states | jacksonville | florida | 32216 | south | office supplies | paper | xerox 1887 | 91.056 | 6 | 0.2 | 31.8696 | 0.350000 |
| 4 | CA-2014-100328 | OFF-BI-10000343 | 1/28/2014 | 2/3/2014 | standard class | jc15340 | jasper cacioppo | consumer | united states | new york city | new york | 10024 | east | office supplies | binders | pressboard covers with storage hooks 9 12 x 11 light blue | 3.928 | 1 | 0.2 | 1.3257 | 0.337500 |
| 5 | CA-2014-100363 | OFF-FA-10000611 | 4/8/2014 | 4/15/2014 | standard class | jm15655 | jim mitchum | corporate | united states | glendale | arizona | 85301 | west | office supplies | fasteners | binder clips by oic | 2.368 | 2 | 0.2 | 0.8288 | 0.350000 |
| 6 | CA-2014-100363 | OFF-PA-10004733 | 4/8/2014 | 4/15/2014 | standard class | jm15655 | jim mitchum | corporate | united states | glendale | arizona | 85301 | west | office supplies | paper | things to do today spiral book | 19.008 | 3 | 0.2 | 6.8904 | 0.362500 |
| 7 | CA-2014-100391 | OFF-PA-10001471 | 5/25/2014 | 5/29/2014 | standard class | bw11065 | barry weirich | consumer | united states | new york city | new york | 10035 | east | office supplies | paper | strathmore photo frame cards | 14.620 | 2 | 0.0 | 6.7252 | 0.460000 |
| 8 | CA-2014-100678 | FUR-CH-10002602 | 4/18/2014 | 4/22/2014 | standard class | km16720 | kunst miller | consumer | united states | houston | texas | 77095 | central | furniture | chairs | dmi arturo collection missionstyle design wood chair | 317.058 | 3 | 0.3 | -18.1176 | -0.057143 |
| 9 | CA-2014-100678 | OFF-AR-10001868 | 4/18/2014 | 4/22/2014 | standard class | km16720 | kunst miller | consumer | united states | houston | texas | 77095 | central | office supplies | art | prang dustless chalk sticks | 2.688 | 2 | 0.2 | 1.0080 | 0.375000 |
| Order ID | Product ID | Order Date | Ship Date | Ship Mode | Customer ID | Customer Name | Segment | Country | City | State | Postal Code | Region | Category | Sub-Category | Product Name | Sales | Quantity | Discount | Profit | Profit Margin | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9976 | US-2017-169488 | OFF-PA-10000157 | 9/7/2017 | 9/9/2017 | first class | aa10375 | allen armold | consumer | united states | providence | rhode island | 2908 | east | office supplies | paper | xerox 191 | 39.960 | 2 | 0.0 | 18.7812 | 0.470000 |
| 9977 | US-2017-169488 | OFF-PA-10002659 | 9/7/2017 | 9/9/2017 | first class | aa10375 | allen armold | consumer | united states | providence | rhode island | 2908 | east | office supplies | paper | avoid verbal orders carbonless minifold book | 16.900 | 5 | 0.0 | 7.7740 | 0.460000 |
| 9978 | US-2017-169502 | OFF-AP-10001947 | 8/28/2017 | 9/1/2017 | standard class | mg17650 | matthew grinstein | home office | united states | milwaukee | wisconsin | 53209 | central | office supplies | appliances | acco 6 outlet guardian premium plus surge suppressor | 91.600 | 5 | 0.0 | 26.5640 | 0.290000 |
| 9979 | US-2017-169502 | OFF-SU-10004115 | 8/28/2017 | 9/1/2017 | standard class | mg17650 | matthew grinstein | home office | united states | milwaukee | wisconsin | 53209 | central | office supplies | supplies | acme stainless steel office snips | 21.810 | 3 | 0.0 | 5.8887 | 0.270000 |
| 9980 | US-2017-169551 | FUR-BO-10001519 | 7/7/2017 | 7/9/2017 | first class | rl19615 | rob lucas | consumer | united states | philadelphia | pennsylvania | 19120 | east | furniture | bookcases | osullivan 3shelf heavyduty bookcases | 87.210 | 3 | 0.5 | -45.3492 | -0.520000 |
| 9981 | US-2017-169551 | OFF-PA-10004100 | 7/7/2017 | 7/9/2017 | first class | rl19615 | rob lucas | consumer | united states | philadelphia | pennsylvania | 19120 | east | office supplies | paper | xerox 216 | 15.552 | 3 | 0.2 | 5.4432 | 0.350000 |
| 9982 | US-2017-169551 | OFF-ST-10004835 | 7/7/2017 | 7/9/2017 | first class | rl19615 | rob lucas | consumer | united states | philadelphia | pennsylvania | 19120 | east | office supplies | storage | plastic stacking crates casters | 13.392 | 3 | 0.2 | 1.0044 | 0.075000 |
| 9983 | US-2017-169551 | TEC-AC-10002018 | 7/7/2017 | 7/9/2017 | first class | rl19615 | rob lucas | consumer | united states | philadelphia | pennsylvania | 19120 | east | technology | accessories | amazonbasics 3button usb wired mouse | 16.776 | 3 | 0.2 | 4.8231 | 0.287500 |
| 9984 | US-2017-169551 | TEC-AC-10003033 | 7/7/2017 | 7/9/2017 | first class | rl19615 | rob lucas | consumer | united states | philadelphia | pennsylvania | 19120 | east | technology | accessories | plantronics cs510 overthehead monaural wireless headset system | 527.920 | 2 | 0.2 | 85.7870 | 0.162500 |
| 9985 | US-2017-169551 | TEC-PH-10001363 | 7/7/2017 | 7/9/2017 | first class | rl19615 | rob lucas | consumer | united states | philadelphia | pennsylvania | 19120 | east | technology | phones | apple iphone 5s | 683.988 | 2 | 0.4 | -113.9980 | -0.166667 |